SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANTS: Liu, Qingyun 
Chen, Fang . 

(ii) TITLE OF INVENTION: Human Uncoupling Protein 3 
(iii) NUMBER OF SEQUENCES: 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Merck & Co., Inc. 

(B) STREET: P.O. Box 2000 RY60-30 

(C) CITY : Rahway 

(D) STATE: NJ 

(E) COUNTRY: US 

(F) ZIP: 07065-0907 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Hand, J. Mark 

(B) REGISTRATION NUMBER: 36,545 

(C) REFERENCE/DOCKET NUMBER: 19959Y 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 732/594-3905 

(B) TELEFAX: 732/594-4720 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GGTGACCTAC GACATCCTCA AGGAGAAGCT GCTGGACTAC CACCTGCTCA CTGACAACTT 
CCCCTGCCAC TTTGTCTCTG CCTTTGGAGC CGGCTTCTGT GCCACAGTGG TGGCATCCCC 
GGTGGACGTG GTGAAGACCC GGTATATGAA CTCACCTCCA GGCCAGTACT TCAGCCCCCT 
CGACTGTATG ATAAAGATGG TGGCCCAGGA GCGCCACCAG CCTTCTACAA GGGATTTACA 
CCCTCCTTTT TGCGTTTGGG ATCCTGGAAC GTGGTGATGT TCGTAACCTA TGAGCAGCTG 
AAACGGGCCC TGATGAAAGT CCAGATGTTA CGGGAATCAC CGTTTTGAAC AAGACAAGAA 
GGCCACTGGT AGCTAACGTG TCCGAAACCA GTTAAGAATG GAAG 
(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
GGTGACCTAC GACATCCTCA AGGAGAAGCT GCTGGACTAC CACCTGCTCA CTGACAACTT 
CCCCTGCCAC TTTGTCTCTG CCTTTGGAGC CGGCTTCTGT GCCACAGTGG TGGCATCCCC 
GGTGGACGTG GTGAAGACCC GGTATATGAA CTCACCTCCA GGCCAGTACT TCAGCCCCCT 
CGACTGTATG ATAAAGATGG TGGCCCAGGA GCGCCACCAG CCTTCTACAA GGGATTTACA 
CCCTCCTTNT TTGCGTTTGG GATCCTGGAA CGTGGTGATG TTCGTAACCT ATGAGCAGCT 
GAAACGGGCC CTGATGAAAG TCCAGATGTT ACGGGAATCA CCGTTTTGAA CAAGACAAGA 
AGGCCACTGG TAGCTAACGT GTCCGAAACC AGTTAAGAAT GGAAG 
(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CGAGCAGCTG AAACGGGCCC TGATGAAAGT CCAGATGTTA CGGGNATCAC CGTTTTGAAC 60 

5 

AAGACAAGAA GGCCACTGGT AGCTAACGTN TCCGAAACCA GTTAAGATTG GAAGAAAACG 120 
GTCCATCCAC GNACACATGG ACACAGACCC ACACATNTT 159 
10 (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 



40 



50 



55 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

25 TTTTTTTTGT TCTTACTCCC ACACCTAAGG TGGAANTTCT TTTATTGAGT CATAATAATT 60 

TCCCGAGAAT TCCGAGTCCT GCTACTTTAG GTTCTTGCCC AGGAATCCAC CTCTTTTCCC 120 

CCAAGCCCAA CAATCCTTTG AGGTACTCAT GATTGAGCGC GTGGTGGGGG GGGGTGGGGA 180 

30 

AGAGGCTGCA TGGGGGTGGG GCTCCTGTGG CTTCACGTCA TCCACTGTCA CCTCTGGTCC 240 

CCAAGTCTCT GGATCCTTTG GTCTCACCTC TAGACAACCG GCGGGGTTCA AACCTTCTTC 300 

35 CCTGGCAACT CCTCTCTGTC CCGACAAAAT CTCTCCCAAG GCATTGTCCT TGTAGTTAGA 3 60 

TTTACACAGA GCTTTTGCTT TTATAAAGTG CGTTCATGCC CAGCTTCTCA CTTGCATGTC 420 

ATAGCACCCC TGGTGAGGTG GACAGGGAAG GGATGGCTCC CTCCATTTTG TAGGAAAGTN 480 

GGGG 484 
(2) INFORMATION FOR SEQ ID NO: 5: 



45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = 'oligonucleotide' 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



AAGCTGCTGG ACTACCACCT GCTC 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide ■ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TACTGCCCTG GAGGTGAGTT CA 
(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
CCAAGCCCAA CAATCCTTTG A 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
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10 



15 



20 



CCAAAGGATC CAGAGACTTG G 21 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
TTGTGTGGAA TTGTGAGCGG ATAAC 25 
(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
30 (A) DESCRIPTION: /desc = "oligonucleotide" 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCAGGGTTTT CCCAGTCACG AC 22 
(2) INFORMATION FOR SEQ ID NO: 11: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TCGAACTCAC TCACCTCCCC TCTCACCTCA CTGCCCTCAC CAGCCAGCCT CTTGTCAAGT 60 
GATCAGGCTG TCAACCAACT TCTCTAGGAT AAGGTTTCAG GTCAGCCTGT GTGTATAAGA 120 



50 



55 
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CCAGTGCCAA GCCAGAAGCA GCAGAGACAA 
CCTGCTGCCA CCTCCTGGGA TGGAGCCCTA 

5 

GGACTCACAG CCCCACCGCT GCACTGAAGC 
CTCCTCTCGG CCCTAAAGGG ACTGGGCAGA 
10 TTCAGACGTG CCTCCCACCA TGGCTGTGAA 
TGCTGACCTC GTTACCTTTC CACTGGACAC 
GAACCAGGCG GTCCAGACGG CCCGGCTCGT 

15 

GACCATGGTG CGGACTGAGG GTCCCTGCAG 
GCGCCAGATG AGCTTCGCCT CCATCCGCAT 
20 CACCCCCAAA GGCGCGGACA ACTCCAGCCT 
AGGAGCCATG GCGGTGACCT GTGCCCAGCC 
CAGCATACAC CTCGGGCCAT CCAGGAGCGA 

25 

CAGAACCATC GCCAGGGAGG AAGGAGTCAG 
CATGAGGAAT GCTATCGTCA ACTGTGCTGA 
30 GCTGCTGGAC TATCACCTGC TCACTGACAA 
AGCCGGCTTC TGTGCCACAG TGGTGGCCTC 
GAACTCACCT CCAGGCCAGT ACTTCAGCCC 

35 

GGAGGGCCCC ACAGCCTTCT ACAAGGGATT 
GAACGTGGTG ATGTTCGTAA CCTATGAGCA 
40 GTTACGGGAA TCACCGTTTT GAACAAGACA 
ACCAGTTAAG AATGGAAGAA AACGGTGCAT 
GTTTACAGAA CTGTTGTTTA CTTGTTGCTG 

45 

TTCTGGTCTT CACTGCCATG CCTCAAGAAC 
TAAATTATAT TAATTTTTGA AACCCATTAG 
50 AACCAAGATA GATCCATTTG GACAAAATGG 
TGGTCAGATA ATGAATGATA AGCAGGAAGG 
ATGGAGTGGG AGATCAGCCA CGGAGCCTGG 

55 

GACTGCAGCT GCACCATCAC ACATGCATCA 




CAGTGAATGA CAAGGAGGGG CCATCCAATC 180 

GGGAGCCCCT GTGCTGCCCC TGCCGTGGCA 240 

CCAGGGCTGT GGAGCAGCTC TCTCCTTGGA 3 00 

GCCTTCCAGG ACTATGGTTG GACTGAAGCC 360 

GTTCCTGGGG GCAGGCACAG CAGCCTGTTT 420 

AGCCAAGGTC CGCCTGCAGA TCCAGGGGGA 480 

GCAGTACCGT GGCGTGCTGG GCACCATCCT 540 

CCCCTACAAT GGGCTGGTGG CCGGCCTGCA 600 

CGGCCTTTAC GACTCCGTCA AGCAGGTGTA 660 

CACTACCCGG ATTTTGGCCG GCTGCACCAC 720 

CACAGATGTG GTGAAGGTCC GATTTCAGGC 780 

CAGAAAATAC AGCGGGACTA TGGACGCCTA 840 

GGGCCTGTGG AAAGGAACTT TGCCCAACAT 900 

GGTGGTGACC TACGACATCC TCAAGGAGAA 960 

CTTCCCCTGC CACTTTGTCT CTGCCTTTGG 1020 

CCCGGTGGAC GTGGTGAAGA CCCGGTATAT 1080 

CCTCGACTGT ATGATAAAGA TGGTGGCCCA 1140 

TACACCCTCC TTTTTGCGTT TGGGATCCTG 1200 

GCTGAAACGG GCCCTGATGA AAGTCCAGAT 1260 

AGAAGGCCAC TGGTAGCTAA CGTGTCCGAA 1320 

CCACGCACAC ATGGACACAG ACCCACACAT 1380 

ATTCAAGAAA CAGAAGTAGA AGAGAGAGGA 1440 

ACCTTTGTTT TGCACTGACA AGATGGAAAA 1500 

GCATGCCTAA TATTTAGGCA AGAGAAAATA 1560 

AAGGTTGGAG ACGTGTATCC CCGTGAAATC 1620 

ATGGCAAGCA CGGGACAGGA GGGGCCCACA 1680 

AGGGACGCCA CCCAGCAACA CAGAGCTGGC 1740 

TCAGCCTATT TGTAATATGT CTGCCACAGA 1800 
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GAGTCCTTTG GGATTCTAGG AAACCCAAGG AACAAGAGAA AAAACTAGAG CCTGTGCTAA 1860 

AGAAGCCTGC TGGGCCCATG TGAGGCTGGG GTCGTAAATA TTCCCCGACG ACACTGAAGA 1920 

ATCAAGAGGG CAGCCCCCAC TTCTCCTACA AAATGGAGGG AGCCATCCCT TCCCTGTCCA 1980 

CCTCACCAGG GGTGCTATGA CATGCAAGTG AGAAGCTGGG CATGAACGCA CTTTATAAAA 2040 

GCAAAAGCTC TGTGTAAATC TAACTACAAG GACAATGCCT TGGGAGAGAT TTTGTCGGGA 2100 

CAGAGAGGAG TTGCCAGGGA AGAAGGTTTG AAAGATACGG TTGTCTAGAG GTGAGACCAA 2160 

AGGATCCAGA GACTTGGGGA CCAGAGGTGA CAGTGGATGA CGTGAAGCCA CAGGAGCCCC 2220 

ACCCCCATGC AGCTTTTTCC CCACCCCCCC CACCACGCGC TCAATCATGA GTACCTCAAA 2280 

GGATTGTTGG GCTTGGGGGA AAAGAGGTGG ATTCCTGGGC AAGAACCTAA AGTAGCAGGA 2340 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Met Val Gly Leu Lys Pro Ser Asp Val Pro Pro Thr Met Ala Val Lys 
1 5 10 15 

Phe Leu Gly Ala Gly Thr Ala Ala Cys Phe Ala Asp Leu Val Thr Phe 
20 25 30 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin lie Gin Gly Glu Asn Gin 
35 40 45 

Ala Val Gin Thr Ala Arg Leu Val Gin Tyr Arg Gly Val Leu Gly Thr 
50 55 60 

lie Leu Thr Met Val Arg Thr Glu Gly Pro Cys Ser Pro Tyr Asn Gly 
65 70 75 80 

Leu Val Ala Gly Leu Gin Arg Gin Met Ser Phe Ala Ser lie Arg lie 
85 90 95 

Gly Leu Tyr Asp Ser Val Lys Gin Val Tyr Thr Pro Lys Gly Ala Asp 
100 105 110 
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Asn Ser Ser Leu Thr Thr Arg lie Leu Ala Gly Cys Thr Thr Gly Ala 
115 120 125 

Met Ala Val Thr Cys Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe 
5 130 135 140 

Gin Ala Ser lie His Leu Gly Pro Ser Arg Ser Asp Arg Lys Tyr Ser 
145 150 155 160 

10 Gly Thr Met Asp Ala Tyr Arg Thr He Ala Arg Glu Glu Gly Val Arg 

165 170 175 



Gly Leu Trp Lys Gly Thr Leu Pro Asn He Met Arg Asn Ala He Val 

180 185 190 

Asn Cys Ala Glu Val Val Thr Tyr Asp He Leu Lys Glu Lys Leu Leu 

195 200 205 



Asp Tyr His Leu Leu Thr Asp Asn Phe Pro Cys His Phe Val Ser Ala 
20 210 215 220 

Phe Gly Ala Gly Phe Cys Ala Thr Val Val Ala Ser Pro Val Asp Val 
225 230 235 240 

25 Val Lys Thr Arg Tyr Met Asn Ser Pro Pro Gly Gin Tyr Phe Ser Pro 

245 250 255 



Leu Asp Cys Met He Lys Met Val Ala Gin Glu Gly Pro Thr Ala Phe 

260 ' 265 270 

Tyr Lys Gly Phe Thr Pro Ser Phe Leu Arg Leu Gly Ser Trp Asn Val 

275 280 " 285 



Val Met Phe Val Thr Tyr Glu Gin Leu Lys Arg Ala Leu Met Lys Val 
35 290 295 300 

Gin Met Leu Arg Glu Ser Pro Phe 
305 310 

40 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CATAGAATTC CAGGACTATG GTTGGAC 27 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CATTCTCGAG CTACCAGTGG CCTTCTTG 28 
20 (2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCCGGATCGG ACTACTAGCA 20 
(2) INFORMATION FOR SEQ ID NO: 16: 



40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGGGGGAGGG CGTGAATGTA A 21 
(2) INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CCAGGAACAG CAGAGACAAC AGTGAATGGT GAGGCCCGGC CGTCAGATCC TGCTGCTACC 60 

TAATGGAGTG GATCCTTAGG GTGGCCCTGC ACTACCCAAC CTTGGCTAGA CGCACAGCTT 120 

CCTCCCTGAA CTGAAGCAAA AGATTGCCAG GCAAGCTCTC TCCTCGGACC TCCATAGGCA 180 

GCAAAGGAAC CAGGCCCATT CCCCGGGACC ATGGTTGGAC TTCAGCCCTC CGAAGTGCCT 240 

CCCACAACGG TTGTGAAGTT CCTGGGGGCC GGCACTGCGG CCTGTTTTGC GGACCTCCTC 300 

ACTTTTCCCC TGGACACCGC CAAGGTCCGT CTGCAGATCC AAGGGGAGAA CCCAGGGGCT 360 

CAGAGCGTGC AGTACCGCGG TGTGCTGGGT ACCATCCTGA CTATGGTGCG CACAGAGGGT 420 

CCCCGCAGCC CCTACAGCGG ACTGGTCGCT GGCCTGCACC GCCAGATGAG TTTTGCCTCC 480 

ATTCGAATTG GCCTCTACGA CTCTGTCAAG CAGTTCTACA CCCCCAAGGG AGCGGACCAC 540 

TCCAGCGTCG CCATCAGGAT TCTGGCAGGC TGCACGACAG GAGCCATGGC AGTGACCTGC 600 

GCCCAGCCCA CGGATGTGGT GAAGGTCCGA TTTCAAGCCA TGATACGCCT GGGAACTGGA 660 

GGAGAGAGGA AATACAGAGG GACTATGGAT GCCTACAGAA CCATCGCCAG GGAGGAAGGA 720 

GTCAGGGGCC TGTGGAAAGG GACTTGGCCC AACATCACAA GAAATGCCAT TGTCAACTGT 780 

GCTGAGATGG TGACCTACGA CATCATCAAG GAGAAGTTGC TGGAGTCTCA CCTGTTTACT 840 

GACAACTTCC CCTGTCACTT TGTCTCTGCC TTTGGAGCTG GCTTCTGTGC CACAGTGGTG 900 

GCCTCCCCGG TGGATGTGGT AAAG AC CCG A TACATGAACG CTCCCCTAGG CAGGTACCGC 960 

AGCCCTCTGC ACTGTATGCT GAAGATGGTG GCTCAGGAGG GACCCACGGC CTTCTACAAA 1020 

GGATTTGTGC CCTCCTTTCT GCGTCTGGGA GCTTGGAACG TGATGATGTT TGTAACATAT 1080 

GAGCAACTGA AGAGGGCCTT AATGAAAGTC CAGGTACTGC GGGAATCTCC GTTTTGAACA 1140 

AGGCAAGCAG GCTGCCTGGA ACAGAACAAA GCGTCTCTGC CCTGGGGACA CAGGCCCACA 1200 

CGGTCCAGAA CCCTGCACTG CTGCTGACAC GAGAAACTGA ACTAAAAGAG GAGAGTTTTA 1260 
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GTCCTCCGTG TTTCGTCCTA AAACACCTCT GTTTTGCACT GACCTGATGG GAAATAAATT 1320 

ATATTAATTT TTAAACCCTT TCCGGTTGGA TGCCTAACAT TTAGGCAAGA GACAACAAAG 1380 

AAAACCAGAG TCAACTCCCT TGAAATGTAG GAATAAAGGA TGCATAATAA ACAGGAAAGG 1440 

CACAGGTTTT GAGAAGATCA GCCCACAGTG TTGTCCTTGA ATCAAACAAA ATGGTCGGAG 1500 

GAACCCTTCG GGTTCAGCAC AAAGAGGTGA CTACAGCCTT TTGGTCACCA GATGACTCCG 1560 

CCCCTTTGTA ATGAGTCTGC CAAGTAGACT CTATCAAGAT TCTGGGGAAA GGAGAAAGAA 1620 

CACATTGACC TGCCCGGGCG GCCGCTCGAG CCCTATGA 1658 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 308 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Val Gly Leu Gin Pro Ser Glu Val Pro Pro Thr Thr Val Val Lys 
1 5 10 15 

Phe Leu Gly Ala Gly Thr Ala Ala Cys Phe Ala Asp Leu Leu Thr Phe 
20 25 30 

Pro Leu Asp Thr Ala Lys Val Arg Leu Gin lie Gin Gly Glu Asn Pro 
35 40 45 

Gly Ala Gin Ser Val Gin Tyr Arg Gly Val Leu Gly Thr lie Leu Thr 
50 55 60 

Met Val Arg Thr Glu Gly Pro Arg Ser Pro Tyr Ser Gly Leu Val Ala 
65 70 75 80 

Gly Leu His Arg Gin Met Ser Phe Ala Ser lie Arg lie Gly Leu Tyr 
85 90 95 

Asp Ser Val Lys Gin Phe Tyr Thr Pro Lys Gly Ala Asp His Ser Ser 
100 105 110 

Val Ala He Arg He Leu Ala Gly Cys Thr Thr Gly Ala Met Ala Val 
115 120 125 

Thr Cys Ala Gin Pro Thr Asp Val Val Lys Val Arg Phe Gin Ala Met 
130 135 140 
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He Arg Leu Gly Thr Gly Gly Glu Arg Lys Tyr Arg Gly Thr Met Asp 
145 150 155 160 

Ala Tyr Arg Thr He Ala Arg Glu Glu Gly Val Arg Gly Leu Trp Lys 
165 170 175 

Gly Thr Trp Pro Asn He Thr Arg Asn Ala He Val Asn Cys Ala Glu 
180 185 190 

Met Val Thr Tyr Asp He He Lys Glu Lys Leu Leu Glu Ser His Leu 
195 200 205 



Phe Thr Asp Asn Phe Pro Cys His Phe Val Ser Ala Phe Gly Ala Gly 
15 210 215 220 

Phe Cys Ala Thr Val Val Ala Ser Pro Val Asp Val Val Lys Thr Arg 

. 225 230 235 240 

20 Tyr Met Asn Ala Pro Leu Gly Arg Tyr Arg Ser Pro Leu His Cys Met 

245 250 255 



Leu Lys Met Val Ala Gin Glu Gly Pro Thr Ala Phe Tyr Lys Gly Phe 
260 265 270 

Val Pro Ser Phe Leu Arg Leu Gly Ala Trp Asn Val Met Met Phe Val 

275 280 285 



Thr Tyr Glu Gin Leu Lys Arg Ala Leu Met Lys Val Gin Val Leu Arg 
30 290 295 300 

Glu Ser Pro Phe 
305 

35 (2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE : nucleic acid 
40 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "degenerate oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCNCTGGAYA CNGCYAA 17 
(2) INFORMATION FOR SEQ ID NO: 20: 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "degenerate oligonucleotide" 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CAGCCCACNG ANGTNGT 17 
15 (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



30 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "degenerate oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TTCACCACRT CNACNGG 17 
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